Automatic Conversion of Dialectal Tamil Text to Standard Written Tamil Text Using Fsts Rules, Analogy, and Social Factors Codetermine Past-tense Formation Patterns in English Revisiting Word Neighborhoods for Speech Recognition
نویسندگان
چکیده
Word neighborhoods have been suggested but not thoroughly explored as an explanatory variable for errors in automatic speech recognition (ASR). We revisit the definition of word neighborhoods, propose new measures using a fine-grained articulatory representation of word pronunciations, and consider new neighbor weighting functions. We analyze the significance of our measures as predictors of errors in an isolated-word ASR system and a continuous-word ASR system. We find that our measures are significantly better predictors of ASR errors than previously used neighborhood density measures.
منابع مشابه
Automatic Conversion of Dialectal Tamil Text to Standard Written Tamil Text using FSTs
We present an efficient method to automatically transform spoken language text to standard written language text for various dialects of Tamil. Our work is novel in that it explicitly addresses the problem and need for processing dialectal and spoken language Tamil. Written language equivalents for dialectal and spoken language forms are obtained using Finite State Transducers (FSTs) where spok...
متن کاملRules, Analogy, and Social Factors Codetermine Past-tense Formation Patterns in English
We investigate past-tense formation preferences for five irregular English verb classes. We gathered data on a large scale using a nonce probe study implemented on Amazon Mechanical Turk. We compare a Minimal Generalization Learner (which infers stochastic rules) with a Generalized Context Model (which evaluates new items via analogy with existing items) as models of participant choices. Overal...
متن کاملTamil IT ! : Interactive Speech Translation in Tamil
The Tamil IT! (Interactive Translation) speech translation system is intended to allow unsophisticated users to communicate across the Tamil ↔ English language barrier, without strong domain restrictions, despite the error prone nature of current speech and translation technologies. Achieving this ambitious goal depends in large part on allowing the users to interactively correct recognition an...
متن کاملTranslating Tamil Speech (SL) as English Text Message (TL) in Android Mobile Phones
Mobiles phones are used every nook and corner and every man, hence innovative technological applications are needed. Moreover, in the scenario of android mobiles not only professionals but even common users expect ample innovations. The paper focuses on translating Tamil speech (SL) as English text message (TL). Even though there are some applications used for translating SL to TL, its one step...
متن کاملThe Impact of Input Enrichment in Long Text vs. Short Texts on Grammatical Accuracy in Writing Among Elementary Language Learners
This study was conducted to investigate the influence of teaching accurate grammar inwriting via enriched long text and short text for the elementary students atShokouhe_Farhang institute. The homogenized subjects were divided into two groups of 18and 17 participants. Using a writing exam as a pretest in order to check the students’knowledge in English past tense. The control group received the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014